Two-phase trace-driven simulation (TPTS): a fast multicore processor architecture simulation approach

نویسندگان

  • Hyunjin Lee
  • Lei Jin
  • Kiyeon Lee
  • Socrates Demetriades
  • Michael Moeng
  • Sangyeun Cho
چکیده

Simulation is indispensable in computer architecture research. Researchers increasingly resort to detailed architecture simulators to identify performance bottlenecks, analyze interactions among different hardware and software components, and measure the impact of new design ideas on the system performance. However, the slow speed of conventional execution-driven architecture simulators is a serious impediment to obtaining desirable research productivity. This paper describes a novel fast multicore processor architecture simulation framework called Two-Phase Trace-driven Simulation (TPTS), which splits detailed timing simulation into a trace generation phase and a trace simulation phase. Much of the simulation overhead caused by uninteresting architectural events is only incurred once during the cycle-accurate simulationbased trace generation phase and can be omitted in the repeated trace-driven simulations. We report our experiences with tsim, an event-driven multicore processor architecture simulator that models detailed memory hierarchy, interconnect, and coherence protocol based on the TPTS framework. By applying aggressive event filtering, tsim achieves an impressive simulation speed of 146 millions of simulated instructions per second, when running 16-thread parallel applications. Copyright q 2010 John Wiley & Sons, Ltd.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Moola: Multicore Cache Simulator

Chip multiprocessors have become the normative architecture for medium and high performance processors. These devices introduce new questions and research topics. One such topic is exploring the design space of a cachememory hierarchy that prevents the memory accesses from being a limiting factor on system performance. Simulation of system workloads is a widely accepted method for evaluating pr...

متن کامل

Efficient On-Line Trace Driven Simulation of Parallel Computer Architectures

This paper presents an approach for the synchronization of the trace generator and the target architectural and memory models in online, trace-driven simulations of parallel computer architectures. This approach eliminates the need for special concurrency control mechanisms conventionally used to achieve this synchronization, thus providing for simplicity and portability. The validity of the pr...

متن کامل

Empirical and Statistical Application Modeling Using On-chip Performance Monitors

ions are estimated resulting in execution time estimates fairly close to the genuine architecture. Many times trace-driven simulations accept compiled execution traces. These are particularly used for determining features of next generation architectures that use the same instruction set architecture. Direct simulation involves a gate-level simulation of architecture (usually in some kind of ha...

متن کامل

A Clustering Approach to Scientific Workflow Scheduling on the Cloud with Deadline and Cost Constraints

One of the main features of High Throughput Computing systems is the availability of high power processing resources. Cloud Computing systems can offer these features through concepts like Pay-Per-Use and Quality of Service (QoS) over the Internet. Many applications in Cloud computing are represented by workflows. Quality of Service is one of the most important challenges in the context of sche...

متن کامل

Summarizing multiprocessor program execution with versatile, microarchitecture-independent snapshots

Computer architects rely heavily on software simulation to evaluate, refine, and validate new designs before they are implemented. However, simulation time continues to increase as computers become more complex and multicore designs become more common. This thesis investigates software structures and algorithms for quickly simulating modern cache-coherent multiprocessors by amortizing the time ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Softw., Pract. Exper.

دوره 40  شماره 

صفحات  -

تاریخ انتشار 2010